AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Quantized Inference

# Quantized Inference

Llama 3.2 1B Instruct GGUF
Llama-3.2-1B-Instruct is a 1B-parameter instruction-fine-tuned model based on the Llama architecture, offering multiple quantization formats to accommodate different hardware requirements.
Large Language Model Supports Multiple Languages
L
Mungert
708
3
Mxbai Rerank Large V2 GGUF
Apache-2.0
mxbai-rerank-large-v2 is a multilingual text reranking model that supports multiple languages and various quantization formats, suitable for different hardware environments.
Text Embedding Supports Multiple Languages
M
Mungert
2,209
2
Gemmax2 28 2B 4bit
Apache-2.0
The GemmaX2-28-2B GGUF quantized model is a collection of quantized versions of the GemmaX2-28-2B-v0.1 translation large language model developed by Xiaomi, supporting machine translation tasks in 28 languages.
Machine Translation Transformers Supports Multiple Languages
G
Tonic
19
1
Whisperkit Pro
Other
WhisperKit Pro is the commercial version of WhisperKit, focusing on automatic speech recognition (ASR) tasks, supporting quantization technology for efficient speech processing.
Speech Recognition
W
argmaxinc
1,862
14
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase